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Inhibitors of Apoptosis 
INTRODUCTION 

Fjie^d of th3 Invertipn 

The field of this invention is human proteins involved in the inhibition of 
apoptosis, or programmed cell death. 

5 

Background 

Cellular apoptosis, or programmed cell death, may be initiated by a variety of 
different stimuli including viral infection, certain cell-culture conditions, cell-cell 
signaling, cytokines, etc. Elucidation of signal transduction pathways leading to 

10 apoptosis would provide valuable insight into a variety of pathogenic mechanisms. 
Accordingly, the ability to exogenously modulate the induction of apoptosis would 
yield therapeutic application for numerous clinical indications. In addition, 
components of such pathways would provide valuable targets for automated, cost- 
effective, high throughput drug screening and hence would have immediate 

15 application in domestic and international pharmaceutical and biotechnology drug 
development programs. 

Relevant Literature 

Rothe et al (1994) Cell 78, 681-692, report the existence of tumor necrosis 
20 factor (TNF) receptor associated proteins which co-immunoprecipitate with a TNF 
receptor; see also Rothe, et al., pending US patent application Serial No: 08/446,915. 
Roy, et al. (1995) Cell 80, 167-178 disclose the gene for a human neuronal apoptosis 
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inhibitory protein, Birnbaum et al. (1994) J Virol 68, 2521-2528 disclose an inhibitor 
of apoptosis (iap) gene, Op-iap from the Orgyia pseudotsugata nuclear polyhedrosis 
virus (OpMNPV) with sequence similarity to two other viral genes: Cp-iap derived 
from Cydia pomonella granulosis virus (CpGV), and iap derived from the Autographa 
5 californica nuclear polyhedrosis virus (AcMNPV) . Clem and Miller (1994), in 
Apoptosis II: The Molecular Basis of Apoptosis in Disease, pp 89-1 10, Cold Spring 
Harbor Laboratory Press, provide a recent review of apoptosis regulation by insect 
viruses. 



10 SUMMARY OF THE INVENTION 

The invention provides methods and compositions relating to novel human 
cellular inhibitor of apoptosis proteins (c-IAP). The subject proteins comprise a series 
of defined structural domain repeats and/or a RING finger domain; in particular, at 
least two of a first domain repeat comprising SEQUENCE ID NO: 5 or 6; a second 

15 domain repeat comprising SEQUENCE ID NO: 7 or 8; and a third domain repeat 
comprising SEQUENCE ID NO: 9 or 10; and/or a RING finger domain comprising 
SEQUENCE ID NO: 11 or 12, or a consensus sequences derived from these human 
genes. The proteins provide a c-IAP specific function, with preferred proteins being 
capable of modulating the induction of apoptosis; for example, by binding a human 

20 tumor necrosis factor receptor associated factor, TRAP. The compositions include 
nucleic acids which encode the subject c-IAP and hybridization probes and primers 
capable of hybridizing with the disclosed c-IAP genes. 

The invention includes methods of using the subject compositions in therapy 
(e.g. gene therapy to enhance expression of a c-IAP gene), in diagnosis (e.g. genetic 

25 hybridization screens for c-IAP gene mutations, and in the biopharmaceutical industry 
(e.g. reagents for increasing yields of recombinant protein by enhancing host cell 
survival in culture, for screening chemical libraries for lead compounds for a 
pharmacological agent useful in the diagnosis or treatment of disease associated with 
apoptosis regulation, etc.). 

30 



DETAILED DESCRIPTION OF THE INVENTION 
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3 

The invention provides methods and compositions relating to novel cellular 
inhibitor of apoptosis proteins (c-IAPs). The nucleotide sequence of a natural cDNA 
encoding human c-IAP is shown as SEQUENCE ID NO:l and the full conceptual 
translate is shown as SEQUENCE ID NO:2. The nucleotide sequence of another 
5 natural cDNA encoding human C-IAP2 is shown as SEQUENCE ID NO:3 and the full 
conceptual translate is shown as SEQUENCE ID NO:4. The human c-IAPs of the 
invention include incomplete translates of SEQUENCE ID NOS:l and 3 or deletion 
mutants of SEQUENCE ID NOS: 2 and/or 4, which translates or deletions mutants 
have at least one of the human c-IAP specific activities described herein. In addition, 

10 the invention provides nonhuman mammalian homologs of the disclosed human c- 
IAPs. These homologs are encoded by natural cDNAs which are capable of 
specifically hybridizing with one or more of the disclosed human cDNAs under 
hybridization conditions describe below and are isolated using the methods and 
reagents described herein. For example, the amino acid sequence of a murine 

15 homolog of c-IAPl, and the sequence its cDNA are shown in SEQUENCE ID NOS: 
14 and 13. 

The subject proteins comprise a series of defined structural domain repeats 
and/or a RING finger domain shown to be necessary for human c-IAP specific 
function; generally including at least two of: a first domain repeat comprising 

20 SEQUENCE ID NO: 5, 6 or a consensus of 5 and 6, a second domain repeat 

comprising SEQUENCE ID NO: 7, 8 or a consensus of 7 and 8, and a third domain 
repeat comprising SEQUENCE ID NO: 9, 10 or a consensus of 9 and 10; and/or a 
RING finger domain comprising SEQUENCE ID NO: 11, 12 or a consensus of 1 1 and 
12. Preferred domain repeat containing c-IAPs contain each of the three domain 

25 repeats. More preferred c-IAPs comprise the three domain repeats and the C-terminal 
RING finger. To secure or optimize the requisite function for the protein, the repeats 
are usually preceded (N-terminally) and separated by intervening regions of about 1 0 
to about 100 residues, which regions preferably derive from those found in the natural 
c-IAP 1 and C-IAP2 translates. Similarly, the RING finger domain of RING finger 

30 domain containing c-IAPs containing proteins is usually preceded by an N-terminal 
region of about 10 to 300 residues, usually 100 to 300 residues, which region 
preferably derives from those found in the natural c-IAP 1 and C-IAP2 translates. 
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4 

The proteins provide a human c-IAPl or C-IAP2 (c-IAPl/2) specific activity or 
function which may be determined by convenient in vitro, cell-based, or in vivo 
assays. Preferred proteins are capable of modulating the induction of apoptosis. Such 
activity or function may be demonstrated in cell culture (e.g. cell transfections) or in 
5 animals (e.g. in vivo gene therapy, transgenics). c-IAPl/2 specific function can also 
be demonstrated by specific binding to a c-IAPl/2 specific binding target, including 
natural binding targets and nonnatural targets such as c-IAPl/2 specific antibodies . 
For example, c-IAPs comprising at least two of SEQUENCE ID NOS: 6, 7 and 8 are 
capable of specifically binding human* tumor necrosis factor receptor associated factors 

10 1 and 2 (TRAF1 and TRAF2) in simple in vitro binding assays. Finally, specific 

function can be assayed immunologically by the ability of the subject protein to elicit a 
c-IAPl/2 specific antibody in a rodent or rabbit. Generally, human c-IAPl/2- 
specificity of the binding agent is shown by binding equilibrium constants (usually at 
least about 10 7 M* 1 , preferably at least about 10 8 M 1 , more preferably at least about 10 9 

15 M' } K A wide variety of cell-based and cell-free assays may be used to demonstrate 
human c-IAPl/2-specific binding; preferred are rapid in vitro, cell-free assays such as 
mediating or inhibiting human c-IAPl/2-protein (e.g. human C-IAP1-TRAF2) binding, 
immunoassays, etc. 

The claimed human c-IAP proteins are isolated, partially pure or pure and are 
20 typically recombinantly produced. An "isolated" protein for example, is 

unaccompanied by at least some of the material with which it is associated in its 
natural state and constitutes at least about 2%, and preferably at least about 5% by 
weight of the total protein in a given sample; a partially pure protein constitutes at 
least about 10%, preferably at least about 30%, and more preferably at least about 60% 
25 by weight of the total protein in a given sample; and a pure protein constitutes at least 
about 70%, preferably at least about 90%, and more preferably at least about 95% by 
weight of the total protein in a given sample. A wide variety of molecular and 
biochemical methods are available for generating and expressing the subject 
compositions, see e.g. Molecular Cloning, A Laboratory Manual (Sambrook, et al. 
30 Cold Spring Harbor Laboratory), Current Protocols in Molecular Biology (Eds. 

Ausubel, et al., Greene Publ. Assoc., Wiley-Interscience, NY) or that are otherwise 
known in the art.. 
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The invention provides human c-IAPl/2-specific binding agents including 
substrates, natural intracellular binding targets, etc. and methods of identifying and 
making such agents, and their use in diagnosis, therapy and pharmaceutical 
development. For example, human c-IAPl/2-specific agents are useful in a variety of 
5 diagnostic and therapeutic applications, especially where disease or disease prognosis 
is associated with improper utilization of a pathway involving human c-IAPl/2, e.g., 
apoptosis. Novel human c-IAPl/2-specific binding agents include human c-IAPl/2- 
specific antibodies and other natural intracellular binding agents identified with assays 
such as one- and two-hybrid screens, non-natural intracellular binding agents 

10 identified in screens of chemical libraries, etc. 

The invention also provides nucleic acids encoding the subject proteins, which 
nucleic acids may be part of human c-IAPl/2-expression vectors and may be 
incorporated into recombinant cells for expression and screening, transgenic animals 
for functional studies (e.g. the efficacy of candidate drugs for disease associated with 

15 c-IAPl/2 mediated signal transduction), etc., and nucleic acid hybridization probes and 
replication/amplification primers having a human c-IAPl/2 cDNA specific sequence 
contained in SEQUENCE ID NO:l or 3. Nucleic acids encoding human c-IAPl/2 are 
isolated from eukaryotic cells, preferably human cells, by screening cDNA libraries 
with probes or PCR primers derived from the disclosed human c-IAPl/2 cDNA. 

20 In addition, the invention provides nucleic acids sharing sufficient sequence 

similarity with that of the disclosed human c-IAPl/2 cDNAs to effect hybridization 
thereto. Such human c-IAPl/2 cDNA homologs are capable of hybridizing to the 
human c-IAPl/2-encoding nucleic acid defined by SEQUENCE ID NO: 1 or 3 under 
stringency conditions characterized by a hybridization buffer comprising 30% 

25 formamide in 5 x SSPE (0. 1 8 M NaCl, 0.01 M NaP0 4 , pH7.7, 0.001 M EDTA) buffer 
at a temperature of 42 °C and remaining bound when subject to washing at 42 °C with 
the 0.2 x SSPE. Preferred nucleic acids will hybridize in a hybridization buffer 
comprising 50% formamide in 5 x SSPE buffer at a temperature of 42 °C and remain 
bound when subject to washing at 42 °C with 0.2 x SSPE buffer at 42 °C. Human 

30 c-IAPl/2 cDNA homologs can also be characterized by BLASTX (Altschul et al. 
(1990) Basic Local Alignment Search Tool, J Mol Biol 215, 403-410) probability 
scores. Using this nucleic acid sequence search program BLASTX, complete coding 
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region human c-IAPl/2 cDNA homologs provide a Probability P(N) score of less than 
1 .Oe-200. More preferred nucleic acids encode c-IAPs with at least about 50%, 
preferably at least about 60%, more preferably at least 70% pair-wise identity to at 
least one of SEQUENCE ID NOS: 2 and 4. 
5 The subject nucleic acids are isolated, i.e. constitute at least about 0.5%, 

preferably at least about 5% by weight of total nucleic acid present in a given fraction. 
The subject nucleic acids find a wide variety of applications including use as 
translatable transcripts, hybridization probes, PCR primers, therapeutic nucleic acids, 
etc.; use in detecting the presence of human c-IAPl/2 genes and gene transcripts, in 

10 detecting or amplifying nucleic acids encoding additional human c-IAPl/2 homologs 
and structural analogs, and in gene therapy applications. When used as expression 
constructs, the nucleic acids are usually recombinant, meaning they comprise a 
sequence joined to a nucleotide other than that which it is joined to on a natural 
chromosome. The subject nucleic acids may be contained within vectors, cells or 

15 organisms. 

In diagnosis, c-IAPl/2 hybridization probes find use in identifying wild-type 
and mutant c-IAPl/2 alleles in clinical and laboratory samples. Mutant alleles are 
used to generate allele-specific oligonucleotide (ASO) probes for high-throughput 
clinical diagnoses. In therapy, therapeutic c-IAPl/2 nucleic acids are used to modulate 
cellular expression or intracellular concentration or availability of active c-IAPl/2. A 
wide variety of indications may be treated, either prophylactically or therapeutically 
with the subject compositions. For example, where cell-specific apoptosis or other 
limitation of cell growth is desired, e.g. neoproliferative disease, a reduction in 
c-IAPl/2 expression is effected by introducing into the targeted cell type c-IAPl/2 
nucleic acids which reduce the functional expression of c-IAPl/2 gene products (e.g. 
nucleic acids capable of inhibiting translation of a c-IAPl/2 protein). Conditions for 
treatment include restenosis, where vascular smooth muscle cells are involved, 
inflammatory disease states, where endothelial cells, inflammatory cells and 
glomerular cells are involved, myocardial infarction, where heart muscle cells are 
involved, glomerular nephritis, where kidney cells are involved, transplant rejection 
where endothelial cells are involved, infectious diseases such as HIV infection where 
certain immune cells and other infected cells are involved, or the like. 
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These c-IAPl/2 inhibitory nucleic acids are typically antisense: single-stranded 
sequences comprising complements of the disclosed c-IAPl/2 encoding nucleic acid. 
Antisense modulation of the expression of a given c-IAPl/2 protein may employ 
c-IAPl/2 antisense nucleic acids operably linked to gene regulatory sequences. Cell 
5 are transfected with a vector comprising a c-IAPl/2 sequence with a promoter 
sequence oriented such that transcription of the gene yields an antisense transcript 
capable of binding to endogenous c-IAPl/2 protein encoding mRNA. Transcription of 
the antisense nucleic acid may be constitutive or inducible and the vector may provide 
for stable extrachromosomal maintenance or integration. Alternatively, single- 

10 stranded antisense nucleic acids that bind to genomic DNA or mRNA encoding a 
given c-IAPl/2 protein may be administered to the target cell, in or temporarily 
isolated from a host, at a concentration that results in a substantial reduction in 
expression of the targeted protein. 

In other indications, e.g. certain hypersensitivities, atrophic diseases, etc., a 

15 reduction in apoptosis is desired. In these applications, an enhancement in c-IAPl/2 
expression is effected by introducing into the targeted cell type c-IAPl/2 nucleic acids 
which increase the functional expression of c-IAPl/2 gene products. Conditions for 
treatment include multiple sclerosis, where certain neuronal cells are involved, 
inflammatory disease states such as rheumatoid arthritis, where bystander cells are 

20 involved, transplant rejection where graft cells are involved, infectious diseases such 
as HIV infection where certain uninfected host cells are involved, or the like. Such 
nucleic acids may be c-IAPl/2 expression vectors, vectors which upregulate the 
functional expression of an endogenous c-IAPl/2 allele, or replacement vectors for 
targeted correction of c-IAPl/2 mutant alleles. 

25 Various techniques may be employed for introducing of the nucleic acids into 

viable cells. The techniques vary depending upon whether one is using the subject 
compositions in culture or in vivo in a host. Various techniques which have been 
found efficient include transfection with a retrovirus, viral coat protein-liposome 
mediated transfection, see Dzau et al., Trends in Biotech 11, 205-210 (1993). In some 

30 situations it is desirable to provide the nucleic acid source with an agent which targets 
the target cells, such as an antibody specific for a surface membrane protein on the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are 
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employed, proteins which bind to a surface membrane protein associated with 
endocytosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins 
or fragments thereof tropic for a particular cell type, antibodies for proteins which 
undergo internalization in cycling, proteins that target intracellular localization and 
5 enhance intracellular half-life. In liposomes, the nucleic acid concentration in the 
lumen will generally be in the range of about 0.01 \xM to 10 ^M. For other 
techniques, the concentration and application rate is determined empirically, using 
conventional techniques to determine desired ranges. 

Application of the subject therapeutics may be systemic or local, i.e. 

10 administered at the site of interest. Various techniques can be used for providing the 
subject compositions at the site of interest, such as injection, use of catheters, trocars, 
projectiles, pluronic gel, stents, sustained drug release polymers or other device which 
provides for internal access. Where an organ or tissue is accessible because of 
removal from the patient, such organ or tissue may be bathed in a medium containing 

15 the subject compositions, the subject compositions may be painted onto the organ, or 
may be applied in any convenient way. Systemic administration of the nucleic acid 
may be effected using naked DNA, lipofection, liposomes with tissue targeting (e.g. 
antibody). 

The invention provides methods and compositions for enhancing the yield of 
20 many recombinantly produced proteins, such as tissue plasminogen activator (t-PA), 
by increasing maximum cell densities and survival time of host production cells in 
culture. Specifically, cultured cells are transfected with nucleic acids which effect the 
up-regulation of endogenous c-IAP or the expression of an exogenous c-IAP. For 
example, nucleic acids encoding functional c-IAP operably linked to a transcriptional 
promoter are used to over-express the exogenous c-IAP in the host cell (see, 
experimental section, below). Such transformed cells demonstrate enhanced survival 
ability at elevated cell densities and over extended culture periods. 

The invention provides efficient methods of identifying pharmacological 
agents or lead compounds for agents active at the level of a human c-IAP 1/2 
modulatable cellular function, particularly human c-IAP 1/2 mediated signal 
transduction, especially in apoptosis. Generally, these screening methods involve 
assaying for compounds which modulate a human c-IAP 1/2 interaction with a natural 
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c-IAPl/2 binding target. The methods are amenable to automated, cost-effective high 
throughput screening of chemical libraries for lead compounds. Identified reagents 
find use in the pharmaceutical industries for animal and human trials; for example, the 
reagents may be derivatized and rescreened in in vitro and in vivo assays to optimize 
5 activity and minimize toxicity for pharmaceutical development. Target indications 
may include infection, genetic disease, cell growth and regulatory disfunction, such as 
neoplasia, inflammation, hypersensitivity, etc. 

A wide variety of assays for binding agents are provided including protein- 
protein binding assays, immunoassays, cell based assays, etc. The human c-IAPl/2 

10 compositions used the methods are usually added in an isolated, partially pure or pure 
form and are typically recombinantly produced. The human c-IAPl/2 may be part of a 
fusion product with another peptide or polypeptide, e.g. a polypeptide that is capable 
of providing or enhancing protein-protein binding, stability under assay conditions 
(e.g. a tag for detection or anchoring), etc. The assay mixtures comprise a natural 

15 intracellular human c-IAPl/2 binding target such as a TRAF. While native binding 
targets may be used, it is frequently preferred to use portions (e.g. peptides, nucleic 
acid fragments) thereof so long as the portion provides binding affinity and avidity to 
the subject human c-IAPl/2 conveniently measurable in the assay. The assay mixture 
also comprises a candidate pharmacological agent. Candidate agents encompass 

20 numerous chemical classes, though typically they are organic compounds; preferably 
small organic compounds and are obtained from a wide variety of sources including 
libraries of synthetic or natural compounds. A variety of other reagents may also be 
included in the mixture. These include reagents like salts, buffers, neutral proteins, 
e.g. albumin, detergents, etc. which may be used to facilitate optimal binding and/or 

25 reduce non-specific or background interactions, etc. Also, reagents that otherwise 
improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, 
antimicrobial agents, etc. may be used. 

The resultant mixture is incubated under conditions whereby, but for the 
presence of the candidate pharmacological agent, the human c-IAPl/2 specifically 

30 binds the cellular binding target, portion or analog with a reference binding affinity. 
The mixture components can be added in any order that provides for the requisite 
bindings. Incubations may be performed at any temperature which facilitates optimal 
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binding, typically between 4 and 40 °C, more commonly between 15° and 40 °C. 
Incubation periods are likewise selected for optimal binding but also minimized to 
facilitate rapid, high-throughput screening, and are typically between .1 and 10 hours, 
preferably less than 5 hours, more preferably less than 2 hours. 
5 After incubation, the agent-influenced binding between the human c-IAPl/2 

and one or more binding targets is detected by any convenient way. For cell-free 
binding type assays, a separation step is often used to separate bound from unbound 
components. Separation may be effected by precipitation (e.g. TCA precipitation, 
immunoprecipitation, etc.), immobilization (e.g on a solid substrate), etc., followed by 

10 washing by, for examples, membrane filtration (e.g. Whatman's P-81 ion exchange 
paper, Polyfiltronic's hydrophobic GFC membrane, etc.), gel chromatography (e.g. gel 
filtration, affinity, etc.). In addition, one of the components usually comprises or is 
coupled to a label. A wide variety of labels may be employed - essentially any label 
that provides for detection of bound protein. The label may provide for direct 

15 detection as radioactivity, luminescence, optical or electron density, etc. or indirect 
detection such as an epitope tag, an enzyme, etc. A variety of methods may be used 
to detect the label depending on the nature of the label and other assay components. 
For example, the label may be detected bound to the solid substrate or a portion of the 
bound complex containing the label may be separated from the solid substrate, and 

20 thereafter the label detected. Labels may be directly detected through optical or 

electron density, radiative emissions, nonradiative energy transfers, etc. or indirectly 
detected with antibody conjugates, etc. For example, in the case of radioactive labels, 
emissions may be detected directly, e.g. with particle counters or indirectly, e.g. with 
scintillation cocktails and counters. 

25 > The following experiments and examples are offered by way of illustration and 

not by way of limitation. 

EXPERIMENTAL 
The murine cellular inhibitor of apoptosis protein 1 (c-IAPl) was 
30 biochemically purified as a TNF-R2 associated protein using coimmunoprecipitation 
Rothe et al. (1994) supra. A large scale protein purification protocol provided material 
sufficient for peptide sequencing. Fully degenerate oligonucleotides corresponding to 
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two of the isolated peptides were used to specifically amplify a 0.75 kb DNA fragment 
from mouse CT6 RNA by Reverse Transcription-PCR. This DNA fragment was used 
to isolate full-length cDNA clones from a mouse CT6 cDNA library by hybridization 
(50% formamide, SxSSPE, 42°C; filters washed at 42°C with 0.2XSSPE, where 
5 IxSSPE is 0.18 M NaCl, 0.01 M NaP0 4 , pH7.7, 0.001 M EDTA). 

DNA sequence analysis predicted an open reading frame encoding a 612 amino 
acid protein that shows significant sequence similarity (36 % amino acid identity) with 
the 'inhibitor of apoptosis protein 1 (IAP) from insect viruses (Clem, R. J. and Miller, L. 
K., 1994, supra) and the human 'neuronal apoptosis inhibitory protein' (NAIP) (23 % 

10 amino acid identity), that is involved in spinal muscular atrophy (SMA) an inherited 
disease in humans (Roy et ., 1995, supra). To obtain the human c-IAPl gene, the 
originally amplified mouse DNA fragment was used as a probe to screen a HeLa 
cDNA library (30% formamide, SxSSPE, 42°C; filters washed at 42°C with 
0.2xSSPE). Sequence analysis of the isolated cDNA clones revealed that they 

15 correspond to two distinct genes, designated c-IAPl and C-IAP2. The human c-IAPl 
cDNA encodes a protein of 618 amino acids that is 84% identical to murine c-IAPl. 
The human C-IAP2 cDNA encodes a protein of 604 amino acids that shares a high 
degree of amino acid identity with both murine and human c-IAPl (72% and 73%, 
respectively) and represents another member of the IAP superfamily. 

20 Comparison of the amino acid sequence of members of the IAP superfamily 

reveals that they are comprised of at least three distinct domains. The N-terminal 
region of all IAP family members is comprised of 'baculovirus IAP repeat' (BIR) 
motifs (Birnbaum et al., 1994, supra). While the viral proteins contain two repeats, the 
mammalian homologs (c-IAPl, -2) possess three BIR motifs. Similarly, NAIP 

25 contains three BIR repeats. In addition to BIR motifs viral IAPs contain a C-terminal 
RING finger motif. This Zn-binding domain is also present in c-IAPl and -2 but not in 
NAIP. Thus c-IAPl and -2 define a distinct subfamily within the IAP superfamily that 
contain three BIR motifs and a RING finger motif. A RING finger domain is also 
present at the N-terminus of TRAF2 and has been shown to be involved in TRAF2 

30 signal transduction. The RING finger motifs of c-IAPl and -2 share significant 
sequence homology with the RING finger domains of viral IAPs but no homology 
,vith the TRAF2 RING finger domain besides the conserved cysteine and histidine 
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residues. The region between the BIR domain and the RING finger domain of c-IAPl 
and -2 is strongly conserved but does not reveal any significant homology to other 
members of the IAP family or any other proteins in the NCBI database. 

A yeast two-hybrid system was used to determine how c-IAPl and -2 interact 
5 with TNF-R2 and/or TRAFs. The following results were obtained indistinguishably 
for c-IAPl and C-IAP2. Two-hybrid analysis revealed that c-IAPl does not directly 
interact with TNF-R2. However, a direct interaction could be detected between 
c-IAPl and TRAF2. The conserved TRAF domain of TRAF2 (amino acids 264-501) 
is sufficient to mediate this interaction. Consistently, c-IAPl also interacted with 

10 TRAF1 . Further analysis demonstrated that the coiled-coil region within the TRAF 
domain of TRAF2 (amino acids 251-358) is required for interaction with c-IAPl. In 
contrast, the C-terminal region of the TRAF domain (amino acids 359-501) that 
mediates the association of TNF-R2 with TRAF2 is dispensable for interaction of 
c-IAPl with TRAF2. Thus c-IAPl and TNF-R2 bind to non-overlapping docking 

15 sites within the TRAF domain of TRAF2. Consistently, c-IAPl does not interact with 
TRAF3 (e.g. Cheng et al. (1995), supra), which does not contain a coiled-coil region 
with sequence similarity to TRAF2/TRAF1 . Deletion mutagenesis of c-IAPl 
indicated that the N-terminal half of the protein containing the three BIR motifs 
(amino acids 1-336 of c-IAPl and 1-396 of C-IAP2) is sufficient for interaction with 

20 TRAF2 and TRAF 1 . Similarly, combinations of two of the three BIR motifs e.g. 

amino acid residues 46-99 and 204-249 of c-IAPl and 29-82 and 189-234 of C-IAP2, 
separated by IAP1 derived intervening sequences of varying lengths are assayed for 
TRAF1 and TRAF2 binding. This indicates that BIR motifs represent a novel 
proteinrprotein interaction domain. The RING finger domain of c-IAPl/2 (amino acids 

25 571-618 of c-IAPl and 557-604 of C-IAP2) is not required for interaction with TRAFs, 
but rather mediates subsequent steps in the c-IAPl/2 signaling cascade. Similarly, a 
variety of c-IAPl derived N-terminal leader sequences fused to the c-IAPl RING 
finger domain are used to assay signal transduction mediation. In an analogous 
situation, the RING finger domain of TRAF2 has been demonstrated to be required for 

30 TRAF2-mediated activation of NF-kB. 

A transfection based co-immunoprecipitation assay was used to investigate 
how c-IAPl interacts with the complex of TNF-R2 and TRAFs. In this system c-IAPl 
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was N-terminally tagged with a FLAG epitope peptide and expressed in human 
embryonic 293 cells under the control of a constitutive CMV promotor (pRK vector). 
The c-IAPl expression vector was transiently co-transfected into 293 cells with 
expression vectors for TNF-R2 and TRAFs. After 24-36 h, the cells were harvested 
5 and extracts immunoprecipitated with anti-TNF-R2 antibodies, followed by Western 
analysis with anti-FLAG antibodies. This assay demonstrated that while c-IAPl 
associates directly with TRAF1 and TRAF2, its interaction with TNF-R2 is indirect 
and requires the heterocomplex of TRAP 1 and TRAF2. Thus, c-IAPl is a component 
of the TNF-R2 (CD40)/TRAF signaling complex. 

10 To determine the functional properties of c-IAPl transient transfection assays 

were performed in human rhabdomyosarcoma KYM1 cells. The results indicate that 
overexpression of c-IAPl but not of control vector, TRAF1 or TRAF2 protects KYM1 
cells from TNF-induced programmed cell death (apoptosis). Hence, c-IAPl regulates 
the cellular response to TNF by modulating TNF responsiveness, e.g. the initiation of 

15 an apoptotic or protective program. The transient transfection assay also finds use as a 
drug screening assay. In this application, candidate agents are screened as above for 
their ability to modulate the ability of c-IAPl to downregulate apoptosis. 



EXAMPLES 

20 1 . Protocol for human c-IAPl - TRAF2 binding assay. 
A. Reagents: 

- Neutralite Avidin : 20 ^g/ml in PBS. 

- Blocking buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room 
temperature. 

25 - Assay Buffer : 1 00 mM KC1, 20 mM HEPES pH 7.6, 0.25 mM EDTA, 1 % 

glycerol, 0.5% NP-40, 50 mM p-mercaptoethanol, 1 mg/ml BSA, cocktail of protease 
inhibitors. 

- human c-I API 1 Ox stock : 10' 8 - lO^M unlabeled human c-IAPl 
supplemented with 200,000-250,000 cpm of labeled human c-IAPl/21 (Beckman 

30 counter). Place in the 4°C microfridge during screening. 

- Protease inhibitor cocktail (I 000X^1 : 10 mg Trypsin Inhibitor (BMB # 
109894), 10 mg Aprotinin (BMB # 236624), 25 mg Benzamidine (Sigma # B-6506), 
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25 mg Leupeptin (BMB # 1017128), 10 mg APMSF (BMB # 917575), and 2mM 
NaVo 3 (Sigma # S-6508) in 10 ml of PBS. 

- TRAF2: 10* 8 - 10' 5 M biotinylated truncated TRAF2 (residues 264-501) in 

PBS. 

5 B. Preparation of assay plates: 

- Coat with 120 |il of stock N-Avidin per well overnight at 4°C. 

- Wash 2 times with 200 jil PBS. 

- Block with 150 pi of blocking buffer. 

- Wash 2 times with 200 nl PBS. 
10 C. Assay: 

- Add 40 ^1 assay buffer/well. 

- Add 10 nl compound or extract. 

- Add 10 nl 33 P-human c-IAPl (20,000-25,000 cpm/0.1-10 pmoles/well =10 9 - 
10" 7 M final concentration). 

15 - Shake at 25 °C for 1 5 minutes. 

- Incubate additional 45 minutes at 25 °C. 

- Add 40 ill biotinylated truncated TRAF2 (0.1-10 pmoles/40 ul in assay 

buffer) 

- Incubate 1 hour at room temperature. 

20 - Stop the reaction by washing 4 times with 200 |il PBS. 

- Add 150 ill scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 
a. Non-specific binding 
25 b. Soluble (non-biotinylated truncated TRAF2) at 80% inhibition. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
30 foregoing invention has been described in some detail by way of illustration and 

example for purposes of clarity of understanding, it will be readily apparent to those of 
ordinary skill in the art in light of the teachings of this invention that certain changes 
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and modifications may be made thereto without departing from the spirit or scope of 
the appended claims. 



10 



SEQUENCE ID NO: 1,2 
SEQUENCE ID NO: 3, 4 
SEQUENCE ID NO: 5, 6 
SEQUENCE ID NO: 7,8 
SEQUENCE ID NO: 9, 10 
SEQUENCE ID NO: 1 1, 12 
SEQUENCE ID NO: 13, 14 



h (human) c-I API 
h C-IAP2 

h c-I API, 2 repeat 1 
h c-I API, 2 repeat 2 
h c-I API, 2 repeat 3 
h c-I API, 2 RING finger 
m (murine) c-IAP 



cDNA, protein 
cDNA, protein 
protein, protein 
protein, protein 
protein, protein 
protein, protein 
cDNA, protein 



o 



t 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: TULARIK, INC. 
(ii) TITLE OF INVENTION: INHIBITORS OF APOPTOSIS 
(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

(B) STREET: 4 Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US96 / 

(B) FILING DATE: 06 AUG 1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBERS: U.S. Serial NOS . 08/512,946 & 08/569,749 

(B) FILING DATES: 08 AUG 1995 & 08 DEC 1995 

(C) CLASSIFICATION 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME:Brezner, David J. 

(B) REGISTRATION NUMBER: 24,774 

(C) REFERENCE / DOCKET NUMBER: A-62464/DJB 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: {415)781-1989 

(B) TELEFAX: (415)398-3249 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 589 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TCTAAGTAGT ATCTTGGAAA TTCAGAGAGA TACTCATCCT ACCTGAATAT AAACTGAGAT 60 
AAATC CAGTA AAGAAAGTGT AGTAAATTCT ACATAAGAGT CTATCATTGA TTTCTTTTGG 120 
TGGTAAAAAT CTTAGTTCAT GTGAAGAAAT TTCATGTGAA TGTTTTAGCT ATCAAACAGC 180 



ACTGTCACCT ACTCATGCAC AAAACTGCCT CCCAAAGACT TTTCCCAGGT CCCTCGTATC 24 0 

AAAACATTAA GAGTATAATG GAAGATAGCA CGATCTTGTC AGATTGGACA AACAGCAACA 3 00 
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f 



AACAAAAAAT 


GAAGTATGAC 


TTTTCCTGTG 

X X X X WW X >J X w 


AACTCTACAG 


AATGTCTAPA 


tattcaaptt 

X ^» X X L/VlL X X 


360 


TCCCCGCCGG 


GGTGCCTGTC 


TCAGAAAGGA 


GTCTTGCTCG 


TG C!TG G TTTT 


TATTATAPTG 

X «^ X X X ^T>L X w 


420 


GTGTGAATGA 


CAAGGTCAAA 


TGCTTCTGTT 


GTGGCCTGAT 


GCTGGATAAC 


TGGAAACTAG 


480 


GAGACAGTCC 


TATTCAAAAG 


C AT AAAC AG C 


TATATCCTAG 


CTG TAG CTTT 


ATTCAGAATC 

X X V*nV3/v\ X L 


J T vJ 


TGGTTTPAGP 

X XJ W XXX w 


TAGTCTGGGA 

x rtw x \_ X uuun 


TPPAPPTPTA 

X L LflV_> V— X V— X r\ 


AGAATAPGTP 

nwnn X AL w X L 


TCCAATGAGA 


A A P A nTTTTfi 
nnvno X X X X w 


O LI \J 


A P ATTP ATT 

LJ^L Ai X X LA X X 


ATPTPPPAPP 

AX L x LLL>r>LL 


TTt^t CI A A P A T A 

X X VjwAA wA X A 


w X AwL X X O X X 


PAPrTP.P.TTPT 

Law X ww X X L X 


TAPTPPArtPP 

X AL X LLawwL 


DOU 


LLaaa 


P PPTPTT A 7A T 


1 V_ X AwAw L AW 


TTn A Af2 AP AT 
1 1 W AAw a La X 


PTPTTP A TPrt 
L 1 L 1 1 La X Lw 


ArtrtA PT A A PP 
awwaL 1 AAL L 


/ ^ u 


pn T A P A f2TT A 
L L 1 iikVH.nO X X a 


TGP A A TG A GT 


7A PTO A 1\ n 7x 7x f3 

Aw 1 OaAwAAW 


PP A n 2V TTTf T 
LLaUaX X X L X 


TAPPTAPPAT 
X AL L X aL La X 


a l\j X bbLLAl 


Ton 
/OU 


TliAPTTTTTT- 

X ArtU X X X X X X 


GTPAPPATPA 
w X v>nv. X V.n 


flAATTf^rtPAA 
Oaa X X ww Laa 


f^ArtPTnftlTTT 

wAwL X ww XXX 


TT ATT AT AT A 
X XaX XaX^^Xa 


CZd A P PTf3f3 A rt 

wOaL L X wO-Aw 




AT AGGGTAG C 


CTGCTTTGCC 

L X OL X X X VJ v_ w 


TGTGGTGGGA 

X w X ww X www^V 


AG CT P AGT AA 

L X Wlw x nn 


CTGGGAACCA 

L X VjVJurtftV. w^l 


AAGC5ATGATG 


900 


C T ATGT r&G A 

w x .A x o X ^AUn 


APAPPGGAGG 

nVv/i^ L O 0/r>0 VJ 


PATTTTPPPA 

WAX X X X W WWA 


APTGTPPATT 

nL X w X L L/l X X 


TTTGGAAAAT 

XXX VJVjnivln X 


TPTPTAfiAAA 

X L X L X AO AAA 




PTPTrtAP.PTT 


TA G P IV TTTC 7a 
X aw Cii XXX La 


AA X w X VjAU W A 


TPiP Art APAPA 
x VjLaUaLaLa 


TP.PAP.PTPP.A 
X uLawL X Lwa 


ATrtArtA AP AT 
a 1 bnbAALA 1 




1 XAiuX aL 1 o 


nr* P A TPT A f2T 
tftUvl L 1 Aw x 


oil LLAul 1 C 


A f2 PPTC2 A f2P 7A 
AwCL 1 wAwLa 


wL 1 1 oLrtAb 1 


ULlbbJ. X x 1 1 


1 AO ft 


ATTAT^TfW^ 


TP n P IV A Tfl A T 


wa 1 w 1 waAAI 


^5^' 1 " 1*'! M I'^!^n r T , ^2. 
wL Xlllbllu 


Trz a r vnn r vnnc 

1 WA X ww 1 wwL 


1 1 WAWW lOll 


1 1 / ft 

114 U 


GGGAATPTGf? 


agatgatppa 

nVJn X \J/i X L La 


X WWW X AwAAW 


ATrtPP AArtTrt 
A X wLLaaU X w 


nTTTPPAAfSn 
w X X X L L aaw w 


X O X O AO X X L X 


i 0 n n 


TGATACGAAT 


GAAAGGPPAA 

UnnnVJVj L L AA 


wnVj X X X VJ X X w 


ATC5ARATTPA 

X OAVJa X X L<a 


ACSGTACJATAT 

Aww X AV3A X A X 


PPTPATPTTP 

LL X L*AX L X X L 


x ^ 0 w 


TTGAACAGCT 

X X ^7f»^»W^%NJW X 


GTTGTPAAPT 
w x x w x wviv> x 


TPAGATAPPA 

X w Aw A X Aw La 


PTGGAGAAGA 

L X Uw'%w'V%w.A 


AAATGPTRAP 

AnnlUL X UAL 


PP APPAATTA 

LLaLLaaX Xa 


X J ^. u 


X X La X X X X ww 


7v rr'Trzrc ag a ta 

ALL 1 VjwAwAA 


AO 1 1 L 1 1 LAVj 


A A n A TrtPTfZT 
AAV3A 1 wL 1 Vj 1 


PATf2ATf2A AT 
LAlbAl waa! 


A P A PPTfiTPrt 
ALALL 1 0 1 OO 


1 TQA 


f"T7\ 7\ T\ »T^/-«fp/-i o 


v- TTGG AAATG 


GG CTTTAAT A 


GAGACCTGGT 


/*i TV TVTV^TV TVTV^»TV 

GAAACAAACA 


pmrpp TV TV TV /"'^PTV 

GTT C AAAGTA 


1440 


^ TV TV m/"-! f-^m/"* » « 

AAAT C CTG AC 


AAC TG G AG AG 


TV TV ^TfT^TV ^rtTV TV TV TV 

AACTATAAAA 


/-t T\ /~i mm TV TV T^/"^ TV 

CAGTTAATGA 


T ATTG TGT CA 


GCACTTCTAA 


1500 




rY"t/*^ TV TV TV TV TV TV TV 

TGAAAAAAGA 


/"*7v /™< tv tv ^» tv 
GAGGAGGAGA 


7\ r^f* T\ Tv TVTVTV^TV 

AG G AAAAAC A 


TV O TV TV TV Tv 

AG CTG AAGAA 


TV *T*^» O TV TV 

ATGGCATCAG 


1560 


A X wA lilblL 


TV ' I 1 1 T\ TV TTPPO 

ATTAATTCGG 


TVTV/"»TVTV^" , TV^ ,, TV TV 

AAGAACAGAA 


TGGCTCTCTT 


fpp TV TV TV TV •! 

1 lAAlAATTvj 


TV ^ TV T 1 /^ T*^ p P/~' 

AC A I GT0T0C 


162 0 




/■^^TV*T*TV TV *T*^* f l"M 

wwATAATCTT 


T x AAAGG C C A 


TVT^T'TV TV' I 1 "I 1 TV TV 

ATGTAA x TAA 


TTVTiTA^TV/^/^TVTA 

A AAAC Aw wAA 


T\ m/-» TV T*TV TTTt 
CAI bAIAI I A 


I60O 


1 1AAALAAAA 


TvT\/* , TVf"'T\/* , TVTTv 

AAC AC AoA I A 


LCI 1 lACAAw 


f*f~* tv tv f+ TV TV 


PATTP ATAPO 
wAl lwAlALL 


Aill Ibbi 1A 


1/4 0 


TA TA TV TV TV H*f^ 


TGCGGCCAAC 


TV iT 1 /**«'T , T 1 ^« Tv TV TV TV 

ATCTTCAAAA 


TV ^"P/^T'/'^'T'TV A TV 

ACTCjTC 1 AAA 


TV r* TA TA TA TT/"* TA 

AvjAAAA 1 bAL 


TCTACATTGT 


IbUU 


7V ^TV TV O TV TV ^T*T^ 

ATAAG AAC TT 


AT TTGTGG AT 


TV TV ^* TV TV ^HTV ^H/^ TV 

AAGAATATGA 


TV /^T 1 7\ mTV iiiiii^O 

AGTATATTCC 


TV TV 7\ 7V TV 7V T> 

AACAGAAGAT 


GTTT CAGGT C 


1860 


TG TC ACTGG A 


AGAACAATTG 


AGG AGGTTG C 


TV TV ^ TV TV TV TV 

AAGAAGAACG 


aacttgtaaa 


<tm^»m^im7V TT*f~* 

G TGTG T ATGG 


192 0 


TA f*TA TV TV/*"< TV TV/^T 1 

ALAAAG AAG T 


TTCTGTTGTA 


TTTATTCCTT 


w I ww I C A x C 1 


ww 1 Aw 1 A 1 wL 


r*TAf , r , TA TV T^T/^ 

LAbbAA lblb 


ISO U 


CCCCTTCTCT 


AAG AAAATG C 


CCTATTTG C A 


G G GGT ATAAT 


/"»TV TV^O/^T'TV / "1* 

CAAGGGTAC1. 


G TT CuT AC AT 


2 04 0 


TTCTCTCTTA 


TV TV TV TV TV TV TV m TV 

AAGAAAAATA 


G T CT AT ATTT 


T 1 TV TV PTP TV nn 

TAACC 1 wCAi 


TA TA TA TA TA nPTPT 

AAAAAbb 1 C 1 


1 1 AAAAl Al 1 


O T A A 


GTTGAACACT 


TGAAGCCATC 


TAAAGTAAAA 


AGGGAATTAT 


gagtttttca 


ATTAGTAACA 


2160 


TTCATGTTCT 


AGTCTGCTTT 


GGTACTAATA 


ATCTTGTTTC 


tgaaaagatg 


GTATCATATA 


2220 


TTTAATCTTA 


ATCTGTTTAT 


TTACAAGGGA 


AGATTTATGT 


ttggtgaact 


ATATTAGTAT 


2280 


GTATGTGTAC 


CTAAGGGAGT 


AGTGTCACTG 


CTTGTTATGC 


atcatttcag 


GAGTTACTGG 


2340 
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ATTTGTTGTT CTTTCAGAAA GCTTTGAATA CTAAATTATA GTGTAGAAAA GAACTGGAAA 



2400 



CCAGGAACTC TGGAGTTCAT CAGAGT.TATG GTGCCGAATT GTCTTTGGTG CTTTTCACTT 



2460 



GTGTTTTAAA ATAAGGATTT TTCTCTTATT TCTCCCCCTA GTTTGTGAGA AACATCTCAA 



2520 



TAAAGTGCTT TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2580 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Lys Thr Ala Ser Gin Arg Leu Phe Pro Gly Pro Ser Tyr Gin 
15 10 is 

Asn lie Lys Ser lie Met Glu Asp Ser Thr lie Leu Ser Asp Trp Thr 
20 25 30 

Asn Ser Asn Lys Gin Lys Met Lys Tyr Asp Phe Ser Cys Glu Leu Tyr 
35 40 45 

Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro Val Ser Glu 
50 55 60 

Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys 
65 70 75 80 

Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Leu Gly 
85 90 95 

Asp Ser Pro lie Gin Lys His Lys Gin Leu Tyr Pro Ser Cys Ser Phe 
100 105 110 

lie Gin Asn Leu Val Ser Ala Ser Leu Gly Ser Thr Ser Lys Asn Thr 
115 120 125 

Ser Pro Met Arg Asn Ser Phe Ala His Ser Leu Ser Pro Thr Leu Glu 
130 135 140 

His Ser Ser Leu Phe Ser Gly Ser Tyr Ser Ser Leu Ser Pro Asn Pro 
145 150 155 160 

Leu Asn Ser Arg Ala Val Glu Asp lie Ser Ser Ser Arg Thr Asn Pro 
165 170 175 

Tyr Ser Tyr Ala Met Ser Thr Glu Glu Ala Arg Phe Leu Thr Tyr His 
180 185 190 

Met Trp Pro Leu Thr Phe Leu Ser Pro Ser Glu Leu Ala Arg Ala Gly 
195 200 205 

Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala Cys Phe Ala Cys Gly 
210 215 220 

Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asp Ala Met Ser Glu His 
225 230 235 240 



AAAAAAAAA 



2589 
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Arg Arg His Phe Pro Asn Cys Pro Phe Leu Glu Asn Ser Leu Glu Thr 
245 250 255 

Leu Arg Phe Ser lie Ser Asn Leu Ser Met Gin Thr His Ala Ala Arg 
260 265 270 

Met Arg Thr Phe Met Tyr Trp Pro Ser Ser Val Pro Val Gin Pro Glu 
275 280 285 

Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Arg Asn Asp Asp Val 
290 295 300 

Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser Gly Asp 
305 310 315 320 

Asp Pro Trp Val Glu His Ala Lys Trp Phe Pro Arg Cys Glu Phe Leu 
325 330 335 

lie Arg Met Lys Gly Gin Glu Phe Val Asp Glu lie Gin Gly Arg Tyr 
340- 345 350 

Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp Thr Thr Gly Glu 
355 360 365 

Glu Asn Ala Asp Pro Pro lie lie His Phe Gly Pro Gly Glu Ser Ser 
370 375 380 

Ser Glu Asp Ala Val Met Met Asn Thr Pro Val Val Lys Ser Ala Leu 
385 390 395 400 

Glu Met Gly Phe Asn Arg Asp Leu Val Lys Gin Thr Val Gin Ser Lys 
405 410 415 

lie Leu Thr Thr Gly Glu Asn Tyr Lys Thr Val Asn Asp lie Val Ser 
420 425 430 

Ala Leu Leu Asn Ala Glu Asp Glu Lys Arg Glu Glu Glu Lys Glu Lys 
435 440 445 

Gin Ala Glu Glu Met Ala Ser Asp Asp Leu Ser Leu lie Arg Lys Asn 
450 455 460 

Arg Met Ala Leu Phe Gin Gin Leu Thr Cys Val Leu Pro lie Leu Asp 
465 470 475 480 

Asn Leu Leu Lys Ala Asn Val lie Asn Lys Gin Glu His Asp lie lie 
485 490 495 

Lys Gin Lys Thr Gin lie Pro Leu Gin Ala Arg Glu Leu He Asp Thr 
500 505 510 

He Leu Val Lys Gly Asn Ala Ala Ala Asn He Phe Lys Asn Cys Leu 
515 520 525 

Lys Glu He Asp Ser Thr Leu Tyr Lys Asn Leu Phe Val Asp Lys Asn 
530 535 540 

Met Lys Tyr He Pro Thr Glu Asp Val Ser Gly Leu Ser Leu Glu Glu 
545 550 555 560 

Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys Met Asp 
565 570 575 

Lys Glu Val Ser Val Val Phe He Pro Cys Gly His Leu Val Val Cys 
580 585 590 
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Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro lie Cys Arg Gly lie 
595 600 605 

lie Lys Gly Thr Val Arg Thr Phe Leu Ser 
610 615 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GGGCAGCAGG 


TTTACAAAGG 


AGGAAAACGA 


CTTCTTCTAG 


ATTTTTTTTT 


CAGTTTCTTC 


60 


TATAAATCAA 


AACTACCTCC 


CTAGAGAAAG 


GCTAGTCCCT 


TTTCTTCCCC 


ATTCATTTCA 


120 


TTATGAACAT 


AGTAGAAAAC 


AGCATATTCT 


TATCAAATTT 


GATGAAAAGC 


GCCAACACGT 


180 


TTGAACTGAA 


ATACGACTTG 


TCATGTGAAC 


TGTACCGAAT 


GTCTACGTAT 


TCCACTTTTC 


240 


CTGCTGGGGT 


CCCTGTCTCA 


GAAAGGAGTC 


TTGCTCGCGC 


TGGTTTCTAT 


TACACTGGTG 


300 


TGAATGACAA 


GGTCAAATGC 


TTCTGTTGTG 


GCCTGATGCT 


GGATAACTGG 


AAAAGAGGAG 


360 


ACAGTC CTAC 


TGAAAAG CAT 


AAAAAGTTGT 


ATCCTAGCTG 


CAGATTCGTT 


CAGAGTCTAA 


420 


ATTCCGTTAA 


CAACTTGGAA 


GCTACCTCTC 


AGCCTACTTT 


TCCTTCTTCA 


GTAACAAATT 


480 


CCACACACTC 


ATTACTTCCG 


GGTACAGAAA 


ACAGTGGATA 


TTTCCGTGGC 


TCTTATTCAA 


540 


ACTCTCCATC 


AAATCCTGTA 


AACTCCAGAG 


CAAATCAAGA 


TTTTTCTGCC 


TTGATGAGAA 


600 










ACTTACTTTT 


CAGACATGG C 


660 


CATTGACTTT 


TCTGTCGCCA 


ACAGATCTGG 


CAAAAGCAGG 


CTTTTACTAC 


ATAGGACCTG 


720 


GAGACAGAGT 


GGCTTGCTTT 


GCCTGTGGTG 


GAAAATTGAG 


CAATTGGGAA 


CCGAAGGATA 


780 


ATG CTATGTC 


AGAACAC CTG 


AGACATTTTC 


CCAAATGCCC 


ATTTATAGAA 


AATCAGCTTC 


840 


AAGACACTTC 


AAGATACACA 


GTTTCTAATC 


TGAGCATGCA 


GACACATGCA 


GCCCGCTTTA 


900 


AAACATTCTT 


TAACTGGCCC 


TCTAGTGTTC 


TAGTTAATCC 


TGAGCAGCTT 


GCAAGTGCGG 


960 


GTTTTTATTA 


TGTGGGTAAC 


AGTGATGATG 


TCAAATGCTT 


TTGCTGTGAT 


GGTGGACTCA 


1020 


GGTGTTGGGA 


ATCTGGAGAT 


GATCCATGGG 


TTCAACATGC 


CAAGTGGTTT 


CCAAGGTGTG 


1080 


AGTACTTGAT 


AAGAATTAAA 


GGACAGGAGT 


TCATCCGTCA 


AGTTCAAGCC 


AGTTACCCTC 


1140 


ATCTACTTGA 


ACAGCTGCTA 


TCCACATCAG 


ACAGCCCAGG 


AGATGAAAAT 


GCAGAGTCAT 


1200 


CAATTATCCA 


TTTTGAACCT 


GGAGAAGACC 


ATTCAGAAGA 


TGCAATCATG 


ATGAATACTC 


1260 


CTGTGATTAA 


TGCTGCCGTG 


GAAATGGGCT 


TTAGTAGAAG 


CCTGGTAAAA 


CAGACAGTTC 


1320 


AGAGAAAAAT 


CCTAGCAACT 


GGAGAGAATT 


ATAGACTAGT 


CAATGATCTT 


GTGTTAGACT 


1380 


TACTCAATGC 


AGAAGATGAA 


ATAAGGGAAG 


AGGAGAGAGA 


AAGAGCAACT 


GAGGAAAAAG 


1440 
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AATCAAATGA 


TTTATTATTA 


ATC CGGAAGA 


ATAGAATGGC 


ACTTTTTCAA 


CATTTGACTT 


1500 


GTGTAATTCC 


AATCCTGGAT 


AGTCTACTAA 


CTGCCGGAAT 


TATTAATGAA 


CAAGAACATG 


1560 


ATGTTATTAA 


ACAGAAGACA 


CAGACGTCTT 


TACAAGCAAG 


AGAACTGATT 


GATACGATTT 


1620 


TAGTAAAAGG 


AAATATTGCA 


GCCACTGTAT 


TCAGAAACTC 


TCTGCAAGAA 


GCTGAAGCTG 


1680 


TGTTATATGA 


GCATTTATTT 


GTGCAACAGG 


ACATAAAATA 


TATTCCCACA 


GAAGATGTTT 


1740 


CAGATCTACC 


AGTGGAAGAA 


CAATTGCGGA 


GACTACAAGA 


AGAAAGAACA 


TGTAAAGTGT 


1800 


GTATGGACAA 


AGAAGTGTCC 


ATAGTGTTTA 


TTCCTTGTGG 


TCATCTAGTA 


GTATGCAAAG 


1860 


ATTGTGCTCC 


TTCTTTAAGA 


AAGTGTCCTA 


TTTGTAGGAG 


TACAATCAAG 


GGTACAGTTC 


1920 


GTACATTTCT 


TTCATGAAGA 


AGAACCAAAA 


CATCATCTAA 


ACTTTAGAAT 


TAATTTATTA 


1980 


AATGTATTAT 


AACTTTAACT 


TTCATCCTAA 


TTTGGTTTCC 


TTAAAATTTT 


TATTTATTTA 


2040 


CAACTCAACA 


AACATTGTTT 


TGTGTAACAT 


ATTTAATATA 


TGTAT CTAAA 


CCATATGAAC 


2100 


ATATATTTTT 


TAGAAACTAA 


GAGAATGATA 


GGCTTTTGTT 


CTTATGAACG 


AAAAAGAGGT 


2160 


AGCACTACAA 


ACACAATATT 


CAATCAAAAT 


TTCAGCATTA 


TTGAAATTGT 


AAGTGAAGTA 


2220 


AAACTTAAGA 


TATTTGAGTT 


AACCTTTAAG 


AATTTTAAAT 


ATTTTGGCAT 


TGTACTAATA 


2280 


CCGGGAACAT 


GAAGC CAGGT 


GTGGTGGTAT 


GTGCCTGTAG 


TCCCAGGCTG 


AGGCAAGAGA 


2340 


ATTACTTGAG 


CCCAGGAGTT 


TGAATCCATC 


CTGGGCAGCA 


TACTGAGACC 


CTGCCTTTAA 


2400 


AAACAAACAG 


AACAAAAACA 


AAACACCAGG 


GACACATTTC 


TCTGTCTTTT 


TTGATC-Ava J. \a 




TCCTATACAT 


CGAAGGTGTG 


CATATATGTT 


GAATGACATT 


TTAGGGACAT 


GGTGTTTTTA 


2520 


TAAAGAATTC 


TGTGAGAAAA 


AATTTAATAA 


AACCCCCCAA 


ATTAAAAAAA 


AAAAAAAAAA 


2580 


AAAAAAAAAA 


AAAAAAAAAA 


A 








2601 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Asn lie Val Glu Asn Ser lie Phe Leu Ser Asn Leu Met Lys Ser 
15 10 15 

Ala Asn Thr Phe Glu Leu Lys Tyr Asp Leu Ser Cys Glu Leu Tyr Arg 
20 25 30 

Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro Val Ser Glu Arg 
35 40 45 

Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys Val 
50 55 60 

Lvs Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Arg Gly Asp 
65 70 75 80 
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Ser Pro Thr Glu Lys His Lys Lys Leu Tyr Pro Ser Cys Arg Phe Val 
85 90 95 

Gin Ser Leu Asn Ser Val Asn Asn Leu Glu Ala Thr Ser Gin Pro Thr 
100 105 110 

Phe Pro Ser Ser Val Thr Asn Ser Thr His Ser Leu Leu Pro Gly Thr 
115 120 125 

Glu Asn Ser Gly Tyr Phe Arg Gly Ser Tyr Ser Asn Ser Pro Ser Asn 
130 135 140 

Pro Val Asn Ser Arg Ala Asn Gin Asp Phe Ser Ala Leu Met Arg Ser 
145 150 155 160 

Ser Tyr His Cys Ala Met Asn Asn Glu Asn Ala Arg Leu Leu Thr Phe 
165 170 175 

Gin Thr Trp Pro Leu Thr Phe Leu Ser Pro Thr Asp Leu Ala Lys Ala 
180- 185 190 

Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala Cys Phe Ala Cys 
195 200 205 

Gly Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asn Ala Met Ser Glu 
210 215 220 

His Leu Arg His Phe Pro Lys Cys Pro Phe lie Glu Asn Gin Leu Gin 
225 230 235 240 

Asp Thr Ser Arg Tyr Thr Val Ser Asn Leu Ser Met Gin Thr His Ala 
245 250 255 

Ala Arg Phe Lys Thr Phe Phe Asn Trp Pro Ser Ser Val Leu Val Asn 
260 265 270 

Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Asn Ser Asp 
275 280 285 

Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser 
290 295 300 

Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg Cys Glu 
305 310 315 320 

Tyr Leu lie Arg lie Lys Gly Gin Glu Phe lie Arg Gin Val Gin Ala 
325 330 335 

Ser Tyr Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp Ser Pro 
340 345 350 

Gly Asp Glu Asn Ala Glu Ser Ser lie lie His Phe Glu Pro Gly Glu 

355 . 360 365 

Asp His Ser Glu Asp Ala lie Met Met Asn Thr Pro Val lie Asn Ala 
370 375 380 

Ala Val Glu Met Gly Phe Ser Arg Ser Leu Val Lys Gin Thr Val Gin 
385 390 395 400 

Arg Lys He Leu Ala Thr Gly Glu Asn Tyr Arg Leu Val Asn Asp Leu 
405 410 415 

Val Leu Asp Leu Leu Asn Ala Glu Asp Glu He Arg Glu Glu Glu Arg 
420 425 430 
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Glu Arg Ala Thr Glu Glu Lys Glu Ser Asn Asp Leu Leu Leu lie Arg 
435 440 445 

Lys Asn Arg Met Ala Leu Phe Gin His Leu Thr Cys Val He Pro He 
450 455 460 

Leu Asp Ser Leu Leu Thr Ala Gly He He Asn Glu Gin Glu His Asp 
465 470 475 480 

Val He Lys Gin Lys Thr Gin Thr Ser Leu Gin Ala Arg Glu Leu He 
485 490 495 

Asp Thr He Leu Val Lys Gly Asn He Ala Ala Thr Val Phe Arg Asn 
500 505 510 

Ser Leu Gin Glu Ala Glu Ala Val Leu Tyr Glu His Leu Phe Val Gin 
515 520 525 

Gin Asp He Lys Tyr He Pro Thr Glu Asp Val Ser Asp Leu Pro Val 
530 • 535 540 

Glu Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys 
545 550 555 560 

Met Asp Lys Glu Val Ser He Val Phe He Pro Cys Gly His Leu Val 
565 570 575 

Val Cys Lys Asp Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys Arg 
580 585 590 

Ser Thr He Lys Gly Thr Val Arg Thr Phe Leu Ser 
595 600 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Cys Glu Leu Tyr Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val 
1 5 10 15 

Pro Val Ser Glu Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly 
20 25 30 

Val Asn Asp Lys Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn 
35 40 45 

Trp Lys Leu Gly Asp Ser Pro 
50 55 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Cys Glu Leu Tyr Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val 
15 10 15 

Pro Val Ser Glu Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly 
20 25 30 

Val Asn Asp Lys Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn 
35 40 45 

Trp Lys Arg Gly Asp Ser Pro 
50 55 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Leu Ala Arg Ala Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala 
15 10 15 

Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asp 
20 25 30 

Ala Met Ser Glu His Arg Arg His Phe Pro Asn Cys Pro Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Ala Lys Ala Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala 
15 10 15 

Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asn 
20 25 30 

Ala Met Ser Glu His Leu Arg His Phe Pro Lys Cys Pro Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Arg Asn Asp 
15 10 15 

Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser 
20 25 30 

Gly Asp Asp Pro Trp Val Glu His Ala Lys Trp Phe Pro Arg Cys Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Asn Ser Asp 
15 10 15 

Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser 
20 25 30 

Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg Cys Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Glu Arg Thr Cys Lys Val Cys Met Asp Lys Glu Val Ser Val Val 
15 10 15 

Phe lie Pro Cys Gly His Leu Val Val Cys Gin Glu Cys Ala Pro Ser 
20 25 30 

Leu Arg Lys Cys Pro lie Cys Arg Gly lie lie Lys Gly Thr Val Arg 
35 40 45 

Thr Phe Leu Ser 
50 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Glu Glu Arg Thr Cys Lys Val Cys Met Asp Lys Glu Val Ser lie Val 
15 10 15 

Phe lie Pro Cys Gly His Leu Val Val Cys Lys Asp Cys Ala Pro Ser 
20 25 30 

Leu Arg Lys Cys Pro lie Cys Arg Ser Thr lie Lys Gly Thr Val Arg 
35 • 40 45 

Thr Phe Leu Ser 
50 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TTCCTTTACA 


GTGAATACTG 


TAGTCTTAAT 


AGACCTGAGC 


TGACTG CTGC 


AGTTGATGTA 


60 


ACCCACTTTA 


GAGAATACTG 


TATGACATCT 


TCTCTAAGGA 


AAACCAGCTG 


CAGACTTCAC 


120 


TCAGTTCCTT 


TCATTTCATA 


GGAAAAGGAG 


TAGTTCAGAT 


GTCATGTTTA 


AGTCCTTATA 


180 


AGGGAAAAGA 


GCCTGAATAT 


ATGCCCTAGT 


ACCTAGGCTT 


CATAACTAGT 


AATAAGAAGT 


240 


TAGTTATGGG 


TAAATAGATC 


TCAGGTTACC 


CAGAAGAGTT 


CATGTGACCC 


CCAAAGAGTC 


300 


CTAACTAGTG 


TCTTGGCAAG 


TGAGACAGAT 


TTGTCCTGTG 


AGGGTGTCAA 


TTCACCAGTC 


360 


CAAGCAGAAG 


ACAATGAATC 


TATCCAGTCA 


GGTGTCTGTG 


GTGGAGATCT 


AGTGTCAAGT 


420 


GGTGAGAAAC 


TTCATCTGGA 


AGTTTAAGCG 


GTCAGAAATA 


CTATTACTAC 


TCATGGACAA 


480 


AACTGTCTCC 


CAGAGACTCG 


GCCAAGGTAC 


CTTACAC CAA 


AAACTTAAAC 


GTATAATGGA 


540 


GAAGAGCACA 


ATCTTGTCAA 


ATTGGACAAA 


GGAGAGCGAA 


GAAAAAATGA 


AGTTTGACTT 


600 


TTCGTGTGAA 


CTCTACCGAA 


TGTCTACATA 


TTCAGCTTTT 


CCCAGGGGAG 


TTCCTGTCTC 


660 


AGAGAGGAGT 


CTGGCTCGTG 


CTGGCTTTTA 


TTATACAGGT 


GTGAATGACA 


AAGTCAAGTG 


720 


CTTCTGCTGT 


GGCCTGATGT 


TGGATAACTG 


GAAACAAGGG 


GACAGTCCTG 


TTGAAAAGCA 


780 


CAGACAGTTC 


TATCCCAGCT 


GCAGCTTTGT 


ACAGACTCTG 


CTTTC AG CCA 


GTCTGCAGTC 


840 


TCCATCTAAG 


AATATGTCTC 


CTGTGAAAAG 


TAGATTTGCA 


CATTCGT C AC 


CTCTGGAACG 


900 
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AGGTGG C ATT 


GAPTCCAACC 


TGTGCTCTAG 


CC CTCTTAAT 

^« ^« ^» X X* X X X 


TCTAGAGCAG 


TGGAAGAPTT 

a uvinnunv. x a 


960 


CTCATCAAGG 


ATGGATCCCT 


GCAGCTATGC 


CATGAGTACA 


GAAGAGGCCA 


GATTTCTTAC 


1020 


TT AC AG TATG 


TGGCCTTTAA 


GTTTTCTGTC 


ACCAGCAGAG 


CTGGCCAGAG 


CTGGCTTCTA 


1080 


TTACATAGGG 


CCTGGAGACA 


GGGTGGCCTG 


TTTTG C CTGT 


GGTGGGAAAC 


TGAGCAACTG 


1140 


GGAACCAAAG 


GATGATGCTA 


TGTCAGAGCA 


CCGCAGACAT 


TTTCCCCACT 


GTCCATTTCT 


1200 


GGAAAATACT 


TCAGAAACAC 


AGAGGTTTAG 


TATATCAAAT 


CTAAGTATGC 


AGACACACTC 


1260 


TGCTCGATTG 


AGGACATTTC 


TGTAPTGG C C 

A vj A A V7w W W 


ACCTAGTGTT 


P CTG TT C AG C 


CCGAGCAGCT 

w w wn\7 vnu W A 


1320 


TGCAAGTGCT 


GG ATTC TATT 


ACGTGGATCG 

rivv a wwn^ wvj 


CAATGATGAT 


GTCAAGTGCT 


TTTGTTGTGA 

AAA W A A A 


138 0 


TGGTGGCTTG 


AGATGTTGGG 

^»S^«*X ^* X A WWW 


AAC CTGGAG A 


TGACCCCTGG 


ATAGAACACG 


C CAAATGGTT 


1440 


TCCAAGGTGT 


GAGTTCTTGA 


TACGGATGAA 


GGGTCAGGAG 


TTTGTTGATG 


AGATTCAAGC 


1500 


TAGATATCCT 


C AT CTT CTTG 


AGCAGCTGTT 


GTCCACTTCA 


GACACCCCAG 


GAGAAGAAAA 


1560 


TGCTGACCCT 


ACAGAGACAG 


TGGTGCATTT 

X X X X X 


TGGCCCTGGA 


GAAAGTTCGG 


AAGATGTCGT 


1620 


PATGATPAPP 


A PPPPTPTPP 

4"1V_ WW Ui\3l OO 


TTA AAPPAPP 
X Xi"U*lftww<nww 


PTTGGAAATG 

Vv X X UUnnn X Vj 


GGPTTPAGTA 

www X X w4"lw X 


GPAPPPTPPT 
uuno l. v». x vjo x 


1 ft o 

X. O O IS 


GAPAPAPAPP 


PTTPAPPPPP 


APATPPTPPP 

nun X V.V. X V3V3V. 


PAPTGGTGAG 

UnL X wVJ X OriVJ 


AAPTAPAGGA 


PPPTPAATPA 


X / ** VJ 


TATTGTPTPA 


PTAPT'PT'PP A 

vj X X X X X VJri 


ATGPTGAAGA 


TGAGAGAAGA 

X VJnVJnOnnwn 


GAAGAGGAGA 


AGGAAAPAPA 


T ft 0 0 


GACTGAAGAG 


ATGGCATCAG 


GTG APTTAT C 


ACTGATTCGG 


AAGAATAGAA 


TGGCCCTCTT 


1860 


TCAACAGTTG 


ACACATGTPC 


TTPPTATCCT 

A A A A V«w A 


GGATAATCTT 


CTTGAGGCCA 


GTGTAATTAC 

X X nOi X X 


1920 


AAAAPAGGAA 


PATPATATTA 

w*n X V3A X n X X n 


TTAGAPAGAA 


AAPAPAGATA 


CCCTTACAAG 


PAAGAGAGPT 


1980 


t a ttc: a p a p p 


P 1 l"l"l M l' 7A ftTP 7A 
Ol 1 1 1 Aw 1 wA 


71 ZV 71 IV Tf! P 
nUuUnnn X Vj v_ 


TP.P2VP.PPAAP 


ATPTTPAAAA 


APTPTPTPAA 


9 rid. n 


UwAAAl 1 wAL- 


I wUAwl? 1 1A1 


TV TP TV TV TV TV PTT 

A 1 LjAAAAL. 1 1 


TA TTTP TflT 7A TA 
All 1 Vj 1 VjVj AA 


71 71 P 71 71 T 71 TP 71 
AAvj AA 1 A 1 A 


71PT71T71TTPP 
nb 1 A 1 A 1 X 


^ X uu 


7i7iP7AP7iTAP7\p 

AAV_AwAAVjA\_ 


ulll wAV3v?w 1 


1 vj 1 v»A 1 1 uV3>i 


71P.71P.P7iP.TTP. 
AVj AVj L/iU i llj 


PPP 71 P 71 TT 71 P 


71 71 P 71 71 P 71 71 P P 




AAv_ X X Vjv_AAA 


o x oiuiAlvjb 


TV TV TAP TAP.PT 


1 1L1A1 Iblo 


TTP7A TTPPP.T 
X X UnX X X 


PTPPTP71TPT 
w 1 V3b 1 wA iLl 


*5 *> o n 


Aw x Aw IL1 LaL- 


LAubAA lulu 


PPPPTTPTPT 


TV TVppTV 7AP.TP.P 
AAuvjAAo loL 


PPP7ATPTPP71 


Pr2PPr27lP71 TAT 




/""» A TA PPPP TV r^T* 
LAAwwwwAL. 1 


p tp pp p t\ p tv t 


1 1L1L1 Ln 1 Vj 


71 PTP 71 71 P. 71 71 T 
Avj 1 oAAVjAA 1 


PPTPTP71 71 71 P 
Vjw 1V.1 V3/virtVJ 


1A1 1 w 1 1 VjvjA 




p 2i t p 7v p. a 7v p p 


T P T P 7A P TV TV P 7A 


TA 7AP 71 7ATP.7A 71 P 


T 71 PTP 71 TTTP 
X nv. X Vj/-V> 1 1 XL 


APPTPTTPAP 
/iVJv Xv.X 1 v»nu 


PAPPAPATTP 
LnUUnWt X X\. 


^ *± \j \j 


1 Aw ILltlil 


PTV T1PT1TTT1PT 

LAAuAl 1 Aw 1 


TA TATPTTPPTT 
AA1 LI 1 UL 1 1 


T71TP71 71PPPT 
1 A 1 o/vivjvju 1 


APPATTPTAT 
rVO V_ A X XUIXaX 


7A TTT 71 71 f2PTT 
n X X X niiuv X X 




ta c^tptpttp p 


7A. 7APPP TA APPT 


PT TA TPPTP.TT 


P 71 PP T 71 P 71 CZCZ 


71PTPTPTPTP 
XuXVjXVXU 


TTPPAPAPPZV 
X X v*vnv?nVJ\«n 


^ 3 ^ w 


pp 7A p TTrr p ta 
ubAb 1 1 w\jwA 


TGCTTGCTGT 


TA TP TPPTTP 71 


PP 71 P TTPTTP 
uuHV. X XL X Xu 


PATTTPPAAT 

\3r\ XXX w X 


TTPTG AZVAP P 

X X w X VJnnnVj U 


^ O O w 


TTTPP Ti TTP TV 
111 ubA 1 1 l_ A 


Kate J/wA X 1 ww 


yv PPTP71 P71 71 71 
AuL 1 L-AVjAAA 


TP PTP 71 71 71 PP 
XLLX On/vlVv V_ 


APTPPPTPTP 


PTAPTPAPTA 
w X av« X X n 




GTTAGGGTAC 


CCTGTGCTTC 


TTGGTGCTTT 


TCCTTTCTGG 


AAAATAAGGA 


TTTTTCTGCT 


2700 


ACTGGTAAAT 


ATTTTCTGTT 


TGTGAGAAAT 


ATATTAAAGT 


GTTTCTTTTA 


AAGGCGTGCA 


2760 


TCATTGTAGT 


GTGTGCAGGG 


ATGTATGCAG 


GCAAAACACT 


GTGTATATAA 


TAAATAAATC 


2820 


TTTTTAAAAA 


GTGAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AA 




2862 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Lys Thr Val Ser Gin Arg Leu Gly Gin Gly Thr Leu His Gin 
15 10 15 

Lys Leu Lys Arg lie Met Glu Lys Ser Thr lie Leu Ser Asn Trp Thr 
20 25 30 

Lys Glu Ser Glu Glu Lys Met Lys Phe Asp Phe Ser Cys Glu Leu Tyr 
35 • 40 45 

Arg Met Ser Thr Tyr Ser Ala Phe Pro Arg Gly Val Pro Val Ser Glu 
50 55 60 

Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys 
65 70 75 80 

Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Gin Gly 
85 90 95 

Asp Ser Pro Val Glu Lys His Arg Gin Phe Tyr Pro Ser Cys Ser Phe 
100 105 110 

Val Gin Thr Leu Leu Ser Ala Ser Leu Gin Ser Pro Ser Lys Asn Met 
115 120 125 

Ser Pro Val Lys Ser Arg Phe Ala His Ser Ser Pro Leu Glu Arg Gly 
130 135 140 

Gly lie His Ser Asn Leu Cys Ser Ser Pro Leu Asn Ser Arg Ala Val 
145 150 155 160 

Glu Asp Phe Ser Ser Arg Met Asp Pro Cys Ser Tyr Ala Met Ser Thr 
165 170 175 

Glu Glu Ala Arg Phe Leu Thr Tyr Ser Met Trp Pro Leu Ser Phe Leu 
180 185 190 

Ser Pro Ala Glu Leu Ala Arg Ala Gly Phe Tyr Tyr lie Gly Pro Gly 
195 200 205 

Asp Arg Val Ala Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu 
210 215 220 

Pro Lys Asp Asp Ala Met Ser Glu His Arg Arg His Phe Pro His Cys 
225 230 235 240 

Pro Phe Leu Glu Asn Thr Ser Glu Thr Gin Arg Phe Ser lie Ser Asn 
245 250 255 

Leu Ser Met Gin Thr His Ser Ala Arg Leu Arg Thr Phe Leu Tyr Trp 
260 265 270 

Pro Pro Ser Val Pro Val Gin Pro Glu Gin Leu Ala Ser Ala Gly Phe 
275 2'80 285 
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Tyr Tyr Val Asp Arg Asn Asp Asp Val Lys Cys Phe Cys Cys Asp Gly 
290 295 300 

Gly Leu Arg Cys Trp Glu Pro Gly Asp Asp Pro Trp lie Glu His Ala 
305 310 315 320 

Lys Trp Phe Pro Arg Cys Glu Phe Leu lie Arg Met Lys Gly Gin Glu 
325 330 335 

Phe Val Asp Glu lie Gin Ala Arg Tyr Pro His Leu Leu Glu Gin Leu 
340 345 350 

Leu Ser Thr Ser Asp Thr Pro Gly Glu Glu Asn Ala Asp Pro Thr Glu 
355 360 365 

Thr Val Val His Phe Gly Pro Gly Glu Ser Ser Glu Asp Val Val Met 
370 375 380 

Met Ser Thr Pro Val Val Lys Ala Ala Leu Glu. Met Gly Phe Ser Arg 
385 • 390 395 400 

Ser Leu Val Arg Gin Thr Val Gin Arg Gin lie Leu Ala Thr Gly Glu 
405 410 415 

Asn Tyr Arg Thr Val Asn Asp lie Val Ser Val Leu Leu Asn Ala Glu 
420 425 430 

Asp Glu Arg Arg Glu Glu Glu Lys Glu Arg Gin Thr Glu Glu Met Ala 
435 440 445 

Ser Gly Asp Leu Ser Leu lie Arg Lys Asn Arg Met Ala Leu Phe Gin 
450 455 460 

Gin Leu Thr His Val Leu Pro lie Leu Asp Asn Leu Leu Glu Ala Ser 
465 470 475 480 

Val lie Thr Lys Gin Glu His Asp He He Arg Gin Lys Thr Gin He 
485 490 495 

Pro Leu Gin Ala Arg Glu Leu He Asp Thr Val Leu Val Lys Gly Asn 
500 505 510 

Ala Ala Ala Asn He Phe Lys Asn Ser Leu Lys Glu He Asp Ser Thr 
515 520 525 

Leu Tyr Glu Asn Leu Phe Val Glu Lys Asn Met Lys Tyr He Pro Thr 
530 535 540 

Glu Asp Val Ser Gly Leu Ser Leu Glu Glu Gin Leu Arg Arg Leu Gin 
545 550 555 560 

Glu Glu Arg Thr Cys Lys Val Cys Met Asp Arg Glu Val Ser He Val 
565 570 575 

Phe He Pro Cys Gly His Leu Val Val Cys Gin Glu Cys Ala Pro Ser 
580 585 590 

Leu Arg Lys Cys Pro He Cys Arg Gly Thr He Lys Gly Thr Val Arg 
595 600 605 

Thr Phe Leu Ser 
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WHAT IS CLAIMEP IS: 



1 . An isolated nucleic acid having the sequence defined by SEQUENCE ID NO: 
1 or 3, or a fragment thereof capable of specifically hybridizing with a nucleic acid 
5 having the sequence defined by SEQUENCE ID NO: 1 or 3 under stringency 

conditions defined by a hybridization buffer comprising 30% formamide in 5 x SSPE 
(0.18 M NaCl, 0.01 M NaP0 4 , pH7.7, 0.001 M EDTA) buffer at a temperature of 
42°C and remaining bound when subject to washing at 42°C with the 0.2 x SSPE. 

10 2. An isolated nucleic acid according to claim 1 capable of specifically 

hybridizing with a nucleic acid having the sequence defined by SEQUENCE ID NO: 1 
or 3 under stringency conditions defined by a hybridization buffer comprising 50% 
formamide in 5 x SSPE buffer at a temperature of 42 °C and remaining bound when 
subject to washing at 42 °C with 0.2 x SSPE buffer at 42 °C 

15 

3. An isolated nucleic acid according to claim 1 encoding a human cellular 
inhibitor of apoptosis protein (c-IAP) comprising at least two of: a first domain 
comprising SEQUENCE ID NO: 5 or 6, a second domain comprising SEQUENCE ID 
NO: 7 or 8, and a third domain comprising SEQUENCE ID NO: 9 or 10; said protein 
20 having a c-IAP specific activity. 



4. A method of making a human cellular inhibitor of apoptosis protein (c-IAP) 
comprising introducing a nucleic acid according to claim 3 into a host cell, growing 
said host cell under conditions whereby said nucleic acid is expressed as a transcript 
25 and said transcript is expressed as a translation product comprising a cellular inhibitor 
of apoptosis protein, and isolating said translation product. 



5. A method of identifying lead compounds for a pharmacological agent useful in 
the diagnosis or treatment of disease, said method comprising the steps of: 
30 incubating a mixture comprising: 

a human c-IAP made by a method according to claim 4, 
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a natural intracellular human c-I AP binding target, wherein said 
binding target is capable of specifically binding said human c-IAP, and 
a candidate pharmacological agent; 
under conditions whereby, but for the presence of said candidate 
5 pharmacological agent, said human c-IAP specifically binds said binding target at a 
reference affinity; 

detecting the binding affinity of said human c-IAP to said binding target to 
determine an agent-biased affinity, 

wherein a difference between the agent-biased affinity and the test affinity 
10 indicates that said candidate pharmacological agent is a lead compound for a 
pharmacological agent capable of modulating human c-IAP-dependent signal 
transduction. 

6. A method according to claim 5, wherein said human c-IAP binding target 

15 comprises a TRAP or an intracellular fragment of a TRAP sufficient to provide for c- 
IAP-specific binding. 

7. A method of modulating apoptosis regulation in a cell comprising introducing 
into said cell a nucleic acid according to claim 1 whereby said nucleic acid is 

20 expressed in said cell and the resultant gene product modulates apoptosis regulation in 
said cell. 

8. A method of modulating apoptosis regulation in a cell comprising introducing 
into said cell a nucleic acid according to claim 3 whereby said nucleic acid is 

25 expressed in said cell and the resultant gene product modulates apoptosis regulation in 
said cell. 

9. A method according to claim 8 wherein said cell expresses a recombinant 
protein in in vitro culture and said gene product inhibits apoptosis in said cell, 

30 whereby the yield of said recombinant protein is increased. 
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